rank | frequency | n-gram |
---|---|---|
1 | 2559 | -ा |
2 | 2184 | -ी |
3 | 1607 | -र |
4 | 1561 | -न |
5 | 975 | -ल |
rank | frequency | n-gram |
---|---|---|
1 | 421 | -री |
2 | 333 | -ार |
3 | 311 | -या |
4 | 305 | -ला |
5 | 266 | -ान |
rank | frequency | n-gram |
---|---|---|
1 | 232 | -वाँ |
2 | 212 | -िया |
3 | 186 | -पुर |
4 | 108 | -ारी |
5 | 99 | -िंग |
rank | frequency | n-gram |
---|---|---|
1 | 61 | -रिया |
2 | 47 | -लें। |
3 | 47 | -ेला। |
4 | 35 | -ाबाद |
5 | 29 | -लैंड |
rank | frequency | n-gram |
---|---|---|
1 | 22 | -स्थान |
2 | 20 | -ेश्वर |
3 | 16 | -न्द्र |
4 | 15 | -स्तान |
5 | 15 | -ज्ञान |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings